Introduction

Welcome to my portfolio for the course computational musicology!

My name is Sanne Veltman, and for this course, I am analyzing the music of my roommate. Specifically, I will be examining two tracks by the artist HONEY: Ga Maar and Niks Meer Over. These songs explore significant themes in life. I selected these tracks not only because I find them musically compelling and unique but also because they convey important messages that deserve attention.

HONEY’s music is truly inspiring, as it addresses topics that are often surrounded by taboos or societal stigma, such as neurodiversity and mental health. Personally, I believe it is crucial to discuss these subjects, especially as mental well-being is an important aspect of my own life. The way music intertwines with these themes makes the listening experience even more profound.

The two tracks I have chosen are in Dutch and differ significantly from one another, as I will demonstrate in the analysis within this portfolio. I highly recommend listening to them—they are certainly worth your time!

Enjoy!

Visualization class corpus versus own tracks


What is visible? in this visualization you see the class corpus based on arousal, tempo and the colors are the danceability. The two red dots show my tracks within the visualisation. The tempo of both of tracks are, This is very interesting to see, it is like a worldmap with all of the tracks from the students! The tempo of my tracks are correct in this visualisation. If I look at the tracks of the other students, I don’t see a really significant correlation between tempo or arousal. But if the tempo goes down, you can see that the danceability gets a bit lighter in its color. There could be a possible correlation between tempo and danceability here.

Histogram class corpus versus own tracks


What is visible? In this class corpus histogram there is an peak at 150 bpm. I found this interesting and I think it is also mistaken because most of the tracks of the other students are less than 150 in tempo as you can see in the visualisation above.

Personal value visualisation


What is visible? This plot provides a visualization of the musical features of my own tracks. I found this representation particularly insightful in enhancing my understanding of their characteristics. The values presented in the visualization appear to be accurate and align with my observations.

Track 2 exhibits a higher danceability score than Track 1, suggesting that it is more rhythmically engaging and better suited for dancing. Both tracks demonstrate similar arousal levels, indicating comparable intensity and energy. The differences in engagingness and instrumentalness are minimal. The similarity in engagingness suggests that both tracks are equally captivating, while the instrumentalness scores indicate a comparable presence of instrumental sections in both compositions.

A notable difference is observed in tempo, with Track 2 having a significantly higher tempo than Track 1. Additionally, both tracks display nearly identical valence scores, implying a similar emotional tone. However, Track 1 has a slightly higher valence, meaning it conveys a somewhat more uplifting mood compared to Track 2. From a personal perspective, this observation aligns with my listening experience of both tracks.

Chromagram

Track 1


Track 2


What is visible? A chromagram is a visual representation of the pitch content of my tracks. This visualization shows how different pitch classes evolve over time.Track 1 is 180 seconds and track 2 approximately 160 seconds. Track 1 is in key D major and track 2 is in key G minor. The fact that these tracks are in D major and G minor is visible within the chromagrams and I checked this with my roommate and this is confirmed! The chromagram did it right!

Ceptogram

Track 1


Track 2


What is visible?

These ceptograms provide a visual representation of how the spectral envelope of the audio signal changes over time. This is useful for analyzing the timbre and harmonic structure of my tracks. Both tracks have a dominant energy at the lower coefficients, the bottom part of the plots. The green background in both plots suggests a relatively stable spectral envelope. This means that there is an variation in timbre. The first track show low coeffients alternating yellow and dark blue, from my interpretation this means that there is a low frequency harmonics. The second tracks also have low yellow bands, which means from my interpretation that this is more continuous and stable in timbre and harmonics.

Chroma based self - similarity

Track 1


Track 2


What is visible? Track 1 consists of six distinct themes. The first theme is characterized by three repetitions, while the second theme serves as a transition between the first and third themes. At approximately 134 seconds, the fourth section concludes, and the fifth section begins. This transition is also audibly noticeable, as the track shifts from an acappella section to vocals accompanied by instrumentation. Following the fifth section, the track concludes with an outro.

Track 2 is structured into three larger sections and six smaller segments. The first section lasts approximately 50 seconds, the second extends to around 110–120 seconds, and the third continues until the end of the track. A repetition occurs at the end of the first section. Within the second section, there are two smaller thematic segments around the 90-second mark, one of which features a repetition similar to that in the first section. Additionally, after 50 seconds, the distinction between the first and second themes is emphasized through the use of an explicit word..

Key estimation

Track 1


Track 2


What is visible? These key estimation plots visualize how different musical keys are activated throughout the tracks. Each row represent a key, which can we major or minor, and the colors indicate the strength of the presence of that key over time. From my interpretation, both tracks have a similar spread of key activation, with variations occurring over time. The yellow areas indicate stronger activation and this means that those keys are more dominant at those moments. Track 1 in D major and shows a stronger presence of certain keys at the beginning, this is the left side of the plot. Track 2 is in G minor is a bit more consistent in key activation. The yellow bands are more evenly spread compared to track 1. In track 1, something interesting is happening around 130 seconds, there are two dark lines and yellow in the middle. If you listen to the track, more vocals are added to the track. In track 2, something is happening around 50 seconds, this is again when the explicit word is been said and the intensity of the song goes up. There could also be a shift in key at that point. The emotional tone of the tracks are also different. Track 1 is a bit more dramatic and track 2 more uplifting.

Chord estimation

Track 1


Track 2


What is visible? The first track is in D major, while the second track is in G minor. In Track 1, the notes D, E, F#, G, A, B, and C# are present, which correspond to the D major scale.

The tonal structure of the second track is more complex. This complexity arises from the presence of B-flat in the melody. However, G is also prominently featured, suggesting a G minor key, as G minor and B-flat major share the same notes. Additionally, the note D appears more prominently than F. If F were more dominant, the track would likely be in B-flat major. However, since D is more pronounced, it is reasonable to conclude that the track is in G minor.

Timbregram

Track 1


Track 2


What is visible? In Track 1, a clear segmentation is observed, with noticeable transitions occurring around the 50-second and 134-second marks. These transitions suggest distinct sections within the track, likely corresponding to changes in musical or instrumental themes. In particular, the transition at approximately 134 seconds marks a significant shift in the composition, as the track moves from an acappella section to multiple vocal layers accompanied by instrumentation.

In Track 2, the timbre analysis reveals the presence of twelve smaller thematic segments. The first two smaller segments together form a single theme, while the third and fourth segments constitute another distinct theme. Themes 5 through 9 can be grouped as one larger section, although they could also be further subdivided into two smaller parts. Towards the end of the track, the composition is structured into three overarching sections.

Energy novelty

Track 1


Track 2


What is visible These graphs represent energy novelty over time in seconds. The graph from the first track has a high peak surpassing 2.0 The highest peak in the second graph is below 1.0, meaning that the most intense energy change is less extreme compared to the first graph. In the first graph there is a prominent spike after 150 seconds while the second graph does not have not such a strong event toward the end. The overall energy novelty values in the second graph appear lower, meaning there are fewer dramatic changes in energy compared to the firs graph. The first graph has more periods of very low activity between bursts, while the second graph maintains a more consistent level of fluctuations. In the second track there is a high peak around 50 seconds, this is again explicit word in the track.

Tempogram track 1


What is visible? In the first tempogram is a tempogram of track 1, shows multiple tempo candidates across different BPM values. It a more complex structure, this is possibly reflecting tempo ambiguities or polyrhythms within the track. The tempo appears less stable, with more fluctuations over time. This suggests the analysis does not assume periodic repetition, leading to more variability in detected tempo values. The second tempogram is the cycles tempogram version of track one, displays a clearer more stable tempo band. It shows periodic patterns, meaning the analysis assumes a cyclic structure in the music. The dominant tempo line is stronger and more continuous, reflecting the assumption that rhythmic patterns repeat over time. In the non - cyclic case, the analysis does not assume repeating rhythmic patterns, leading to more tempo variations. In the cyclic case, the analysis enforces periodicity, resulting in a more stable and structured tempo variation. This suggests that the piece has some level of rhythmic repetition. The actual BPM of the first track is 87 BPM.

Tempogram track 2


What is visible This tempogram is a version of track 2. According to the tempogram there are multiple strong tempo candidates. Several horizontal bands at different BPM values indicate multiple tempo interpretations. There is a sudden transition around 50 seconds, there is a sharp change in the structure which is possibly marking a change in musical section or rhythmic pattern. There is a higher tempo ambiguity in the first section. The first half, before the 50 seconds, shows a more dense and fluctuating tempo pattern. This suggests rhythmic complexity or variations. In the second part of the tempogram there is a more stable tempo. After the transition, the dominant tempo lines become clearer and more stable. This indicates a more regular rhythmic pattern. The actual BPM of the second track is 127 BPM.

Hierachical clustering


What is visible? For this hierarchical clustering visualization I found my own name within the clustering and compared the tracks next to me with my own tracks. I found it really interesting to see and also to listen that these two tracks of my peers were clustered with my tracks because if you listen to my tracks and the tracks for my peers, they are completely different to eachother.

Classification AI and non-AI

          Truth
Prediction AI Non-AI
    AI     35     17
    Non-AI 14     24


What is visible This visualization is a confusion matrix that compares AI vs. Non-AI predictions against the actual truth labels. AI correctly classified as AI is 33 times, this is a true positive. Non - AI correctly got classified as Non - AI 22 times, this is a true negative. AI misclassified as Non-AI 16 times, this a false negative and Non-AI got classified as AI 19 times, this is a false positive.

# A tibble: 2 × 3
  class  precision recall
  <fct>      <dbl>  <dbl>
1 AI         0.673  0.714
2 Non-AI     0.632  0.585
# A tibble: 2 × 3
  class  precision recall
  <fct>      <dbl>  <dbl>
1 AI         0.660  0.714
2 Non-AI     0.622  0.561

Classification corpus


What is visible This bar chart visualizes the importance of different musical features within the class corpus of 2025. According to this bar chart, the most important features are instrumentalness, suggesting that the presence or absence of vocals plays a significant role in the analysis. danceability and arousal follow closely. This indicates that rhythm and energy levels are also crucial factors. Valence has a slightly lowe importance compared to arousal and danceability. The least important feature according to his bar char is tempo. I found this an interesting outcome of the bar chart.

Classification corpus AI and non - AI danceability and arousal


What is visible The AI - generated tracks and non- AI generated tracks are spread over the plot. The most tracks have are between 3.5 and 5.5 in arousal and are concentrated around 4.0 for arousal and danceability 4.0 - 4.5. The most AI generated tracks are slightly higher in danceability. There is not really a dominant bpm within this plot, all the tracks are between 140 - 160 bpm.

Conclusion

This assignment for Computational Musicology has been an educational adventure in which I gained many new insights. It was particularly interesting to apply various analytical methods and gain a deeper understanding of the two selected tracks.

The most valuable learning experience for me was the tempogram analysis, as it provided a fascinating perspective on the rhythmic structure and dynamics of the tracks. It significantly enhanced my understanding of musical time patterns. Additionally, I found it intriguing to see how different visualizations – from ceptograms to key estimations – all contribute to a richer musical interpretation.

I also found it particularly interesting to see where my tracks were positioned within the visualizations compared to those of my fellow students. This comparative aspect added an extra layer of insight, allowing me to understand the unique characteristics of my tracks in a broader musical context.

I am very pleased with the outcomes of the various analyses and visualizations, and I will certainly take this experience with me into my master’s studies. Finally, I would like to thank HONEY for allowing the use of their tracks and, of course, express my gratitude to the professor and student assistants for their valuable guidance and support throughout this assignment.

Sanne Veltman